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INTEIN-MEDIATED PROTEIN LIGATION OF 
EXPRESSED PROTEINS 

RELATED APPLICATIONS 

This Application is a Continuation-ln-Part of U.S. S.N. 
08/811,492, fileck March 5, 1997 now U.S. Patent No. 
5,834,247, issued ^November 10, 1998, entitled "Modified 
Proteins Comprising\Controllable Intervening Protein 
Sequences Or Their Elements Methods of Producing Same and 
Methods For Purification Of A Target Protein Comprised By A 
Modified Protein", and of ll^S-N. 60/102,413, filed September 
30, 1998, entitled "Intein Mediated Peptide Ligation." 

BACKGROUND OF THE INVENTION 

The present invention relates to methods of intein- 
mediated ligation of proteins. More specifically, the present 
invention relates to intein-mediated ligation of expressed 
proteins containing a predetermined N-terminal residue 
and/or a C-terminal thioester generated via use of one or 
more naturally occurring or modified inteins. Preferably, the 
predetermined residue is cysteine. 



Inteins are the protein equivalent of the self-splicing 
RNA introns (see Perler et al., Nucleic Acids Res. 22:1125- 
1127 (1994)), which catalyze their own excision from a 
precursor protein with the concomitant fusion of the flanking 
protein sequences, known as exteins (reviewed in Perler et al., 
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Curr. Opin. Chem. Biol. 1:292-299 (1997); Perler, F. B. Cell 
92(1):1-4 (1998); Xu et a\.,EMBO J. 1 5(1 9):51 46-51 53 (1996)). 



Studies into the mechanism of intein splicing led to the 
development of a protein purification system that utilized 
thiol-induced cleavage of the peptide bond at the N-terminus 
of the See VMA intein (Chong et a!., Gene 1 92(2):271 -281 
(1997)). Purification with this intein-mediated system 
generates a bacterially-expressed protein with a C-terminal 
thioester (Chong et al., (1997)). In one application, where it is 
described to isolate a cytotoxic protein, the bacterially 
expressed protein with the C-terminal thioester is then fused 
to a chemically-synthesized peptide with an N-terminal 
cysteine using the chemistry described for "native chemical 
ligation" (Evans et al., Protein Sci. 7:2256-2264 (1998); Muir 
et al., Proc. Natl. Acad. Sci. USA 95:6705-6710 (1998)). 

This technique, referred to as "intein-mediated protein 
ligation" (IPL), represents an important advance in protein 
semi-synthetic techniques. However, because chemically- 
synthesized peptides of larger than about 100 residues are 
difficult to obtain, the general application of IPL is limited 
by the requirement of a chemically-synthesized peptide as a 
ligation partner. 



IPL technology would be significantly expanded if an 
expressed protein with a predetermined N-terminus, such as 
cysteine, could be generated. This would allow the fusion of 
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one or more expressed proteins from a host cell, such as 
bacterial, yeast or mammalian cells. 

One method of generating an N-terminal cysteine is with 
the use of proteases. However, proteases have many 
disadvantages, such as the possibility of multiple protease 
sites within a protein, as well as the chance of non-specific 
degradation. Furthermore, following proteolysis, the 
proteases must be inactivated or purified away from the 
protein of interest before proceeding with IPL. (Xu, et al., 
Proc. Natl. Acad. Sci. USA 96(2):388-393 (1999) and Erlandson, 
et al., Chem. Biol., 3:981-991 (1996)) 

There is, therefore, a need for an improved inter- 
mediated protein ligation method which overcomes the noted 
limitations of current IPL methods and which eliminates the 
need for use of proteases to generate an N-terminal cysteine 
residue. Such an improved IPL method would have widespread 
applicability for the ligation of expressed proteins, for 
example, labeling of extensive portions of a protein for, among 
other things, NMR analysis. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, there is 
provided a method for the ligation of expressed proteins 
utilizing one or more inteins which display cleavage at their 
N- and/or C-termini. In accordance with the present 
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invention, such inteins may occur either naturally or may be 
■ modified to cleave at their N- and/or C-termini. Inteins 
displaying N- and/or C-terminal cleavage enable the facile 
isolation of a protein having a C-terminal thioester and a 
protein having an N-terminal amino acid residue such as 
cysteine, respectively, for use in the fusion of one or more 
expressed proteins. Alternatively, the method may be used to 
generate a single protein having both a C-terminal thioester 
and a specified N-terminal amino acid residue, such as 
cysteine, for the creation of cyclic or polymerized proteins. 
These methods involve the steps of generating at least one C- 
terminal thioester-tagged first target protein, generating at 
least one second target protein having a specified N-terminal 
amino acid residue, for example cysteine, and ligating these 
proteins. This method may be used where a single protein is 
expressed, where, for example, the C-terminal thioester end 
of the protein is fused to the N-terminal end of the same 
protein. The method may further include chitin-resin 
purification steps. 

In one preferred embodiment the intein from the RIR1 
Methanobacterium thermoautotrophicum is modified to cleave 
at either the C-terminus or N-terminus. The modified intein 
allows for the release of a bacterially expressed protein 
during a one-column purification, thus eliminating the need 
proteases entirely. DNA encoding these modified inteins and 
plasmids containing these modified inteins are also provided 
by the instant invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram depicting both the N-terminal and 
C-terminal cleavage reactions which comprise inter- 
mediated protein ligation. The modified Mth RIR1 intein was 
used to purify both MBP with a C-terminal thioester and T4 
DNA ligase with an N-terminal cysteine. The Mth RIR1 intein 
for N-terminal cleavage, intein(N), carried the P-1G/N 134 A 
double mutation. The full length fusion protein consisting of 
MBP-intein(N)-CBD was separated from cell extract by binding 
the CBD portion of the fusion protein to a chitin resin. 
Overnight incubation in the presence of 100 mM 2- 
mercaptoethanesulfonic acid (MESNA) induced cleavage of the 
peptide bond prior to the N-terminus of the intein and created 
a thioester on the C-terminus of MBP. The C-terminal 
cleavage vector, intein(C), had the P-iG/CU double mutation. 
The precursor CBD-intein(C)-T4 DNA ligase was isolated from 
induced E. coli cell extract by binding to a chitin resin as 
described for N-terminal cleavage. Fission of the peptide bond 
following the C-terminal residue of the intein at a preferred 
temperature and pH resulted in the production of T4 DNA 
ligase with an N-terminal cysteine. Ligation occurred when 
the proteins containing the complementary reactive groups 
were mixed and concentrated, resulting in a native peptide 
bond between the two reacting species. 
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Figure 2A is a gel depicting the purification of a C- 
terminal thioester-tagged maltose binding protein (MBP) via a 
thiol-inducible Mth RIR1 intein construct pMRB10G (containing 
the modified intein, R(N), with P-1G/N134A mutation) and the 
purification of T4 DNA ligase having an N-terminal cysteine 
using the vector pBRL-A (containing the modified intein, R(C), 
with P- 1 G/C 1 A mutation). Lanes 1-3, purification of maltose 
binding protein (MBP) (M, 43 kDa) with a C-terminal thioester. 
Lane 1. ER2566 cells transformed with plasmid pMRB10G 
following Isopropyl B-D-thiogalactopyranoside (IPTG) 
induction. Lane 2. Cell extract after passage over a chitin 
resin. Note that the fusion protein, M-R(N)-B, binds to the 
resin, where B is the chitin binding domain. Lane 3. Fraction 
3 of the elution from the chitin resin following overnight 
incubation at 4°C in the presence of 100 mM MESNA. Lanes 4- 
6, purification of T4 DNA ligase (L, 56 kDa) with an N- 
terminal cysteine. Lane 4. IPTG induced ER2566 cells 
containing plasmid pBRL-A. Lane 5. Cell extract after 
application to a chitin resin. B-R(C)-L, the fusion protein, 
binds to the resin. Lane 6. Elution of T4 DNA ligase with an 
N-terminal cysteine after overnight incubation at room 
temperature in pH 7 buffer 

Figure 2B is a gel depicting ligation of T4 DNA ligase 
having an N-terminal cysteine to a C-terminal thioester 
tagged MBP. Lane 1. Thioester-tagged MBP. Lane 2. T4 DNA 
ligase with an N-terminal cysteine. Lane 3. Ligation reaction 
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of MBP (0.8 mM) with T4 DNA ligase (0.8 mM), generating M-L, 
after overnight incubation at 4°C. 

Figure 3 is a gel depicting the effect of induction 
temperature on the cleaving and/or splicing activity of the 
Mth RIR1 intein or Mth RIR1 intein mutants. The Mth RIR1 
intein or mutants thereof, with 5 native N- and C-terminal 
extein residues were induced at either 15°C or 37°C. The 
intein was expressed as a fusion protein (M-R-B, 63 kDa) 
consisting of N-terminal maltose binding protein (M, 43 kDa), 
the Mth RIR1 intein (R, 15 kDa) and at its C-terminus was the 
chitin binding domain (B, 5 kDa). Lanes 1 and 2. M-R-B with 
the unmodified Mth RIR1 intein. Note the small amount of 
spliced product (M-B, 48 kDa). Lanes 3 and 4. Mth intein with 
Pro- 1 replaced with Ala, M-R-B(P" 1 A). Both spliced product 
(M-B) and N-terminal cleavage product (M) are visible. Lanes 5 
and 6. Replacement of Pro- 1 with Gly (M-R-B(P" 1 G)) showed 
some splicing as well as N- and C-terminal cleavage, M and M- 
R, respectively. Lanes 7 and 8. The Pro* 1 to Gly and Cys 1 to 
Ser double mutant, M-R-B(P- 1 G/C 1 S), displayed induction 
temperature dependent C-terminal cleavage (M-R) activity. 
Lanes 9 and 10. The M-R-B(P" 1 G/N 1 34a) mutant possessed 
only N-terminal cleavage activity producing M. The Mth intein 
or Mth intein -CBD fusion is not visible in this Figure. 

Figure 4 is a nucleotide sequence (SEQ ID NO:23) 
comparison of wild type Mth RIR1 intein and synthetic Mth 
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RIR1 intein indicating the location of 61 silent base mutations 
designed to increase expression in E. coli. DNA alignment of 
the wild type Mth RIR1 intein (top strand) and the synthetic 
Mth RIR1 intein (bottom strand). To increase expression 
levels in E. coli, 61 silent base changes were made in 49 
seperate codons when creating the synthetic gene. The first 
and last codons of the wild type Mth RIR1 intein are shown in 
bold. 

DETAILED DESCRIPTION 

The present invention provides a solution to the 
limitations of current intermediated ligation methods by 
eliminating the need for a synthetic peptide as a ligation 
partner, and providing a method which is suitable for the 
fusion one or more expressed proteins. 

In general, any intein displaying N- and/or C-terminal 
cleavage at its splice junctions can be used to generate a 
defined N-terminus, such as cysteine as well as a C-terminal 
thioester for use in the fusion of expressed proteins. Inteins 
which may be used in practicing the present invention include 
those described in Perler, et al., Nucleic Acids Res., 
27(1):346-347 (1999). 

In accordance with one preferred embodiment, an intein 
found in the ribonucleoside diphosphate reductase gene of 
Methanobacterium thermoautotrophicum (the Mth RIR1 intein) 
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was modified for the facile isolation of a protein with an N- 
terminal cysteine for use in the in vitro fusion of two 
bacterially-expressed proteins. The 134-amino acid Mth RIR1 
intein is the smallest of the known mini-inteins, and may be 
close to the minimum amino acid sequence needed to promote 
splicing (Smith et.al., J. Bacteriol. 179: 7135-7155 (1997)). 

The Mth RIR1 intein has a proline residue on the N- 
terminal side of the first amino acid of the intein. This 
residue was previously shown to inhibit splicing in the See 
VMA intein (Chong et al., J. Biol. Chem. 273:10567-10577 
(1998)). The intein was found to splice poorly in E. coli when 
this naturally occurring proline is present. Splicing 
proficiency increases when this proline is replaced with an 
alanine residue. Constructs that display efficient N- and C- 
terminal cleavage are created by replacing either the C- 
terminal asparagine or N-terminal cysteine of the intein, 
respectively, with alanine. 

These constructs allow for the formation of an intein- 
generated C-terminal thioester on a first target protein and 
an intein-generated N-terminal cysteine on a second target 
protein. These complementary reactive groups may then be 
ligated via native chemical ligation to produce a peptide bond 
(Evans et al supra (1998), Muir et al supra (1998)). 
Alternatively, a single protein containing both reactive groups 
may be generated for the creation of cyclic or polymerized 
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proteins. Likewise, more than one first or second target 
proteins may be generated via use of multiple mutant inteins. 

As used herein, the terms fusion and ligation are used 
interchangeably. Also as used herein, protein shall mean any 
protein, fragment of any protein, or peptide capable of ligation 
according to the methods of the instant invention. Further, as 
used herein, target protein shall mean any protein the ligation 
of which, according to the methods of the instant invention, is 
desired. 

The general method of intein-mediated protein ligation 
in accordance with the present invention is as follows: 

(1) An intein of interest is isolated and cloned into an 
appropriate expression vector(s) such as bacterial, plant, 
insect, yeast and mammalian cells. 

(2) The intein is engineered for N- and/or C-terminal 
cleavage unless the wild type intein displays the desired 
cleavage activities. In a preferred embodiment, a modified 
intein with the desired cleavage properties can be generated 
by substituting one or more residues within and/or flanking 
the intein sequence. For example, a modified intein having N- 
terminal cleavage activity can be created by changing the last 
intein residue. Alternatively, a modified intein with C- 
terminal cleavage activity can be created by changing the 
first intein residue. 
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(3) The intein with N- and/or C-terminal cleavage 
activity is fused with an affinity tag to allow purification 
away from other endogenous proteins. 

(4) The intein or inteins, either wild type or modified, 
that display N-terminal and/or C-terminal cleavage, or both, 
are fused to the desired target protein coding region or 
regions upstream and/or downstream of the intein. 

(5) An intein that cleaves at its N-terminus in a thiol 
reagent dependent manner is used to isolate a protein with a 
C-terminal thioester. This cleavage and isolation is, for 
example, carried out as previously described for the See VMA 
and Mxe GyrA inteins (Chong et al., Gene 192(2):271 -281 
(1997); Evans et al., Protein Sci. 7:2256-2264 (1998)). As 
discussed previously, multiple C-terminal thioester-tagged 
proteins may be generated at this step . 

(6) A target protein having a specified N-terminus is 
generated by cleavage of a construct containing an intein that 
cleaves at its C-terminus. The specified N-terminal residue 
may be any of the amino acids, but preferably cysteine. As 
discussed previously, this step may alternately generate a 
specified N-terminal on the same protein containing a C- 
terminal thioester, to yield a single protein containing both 
reactive groups. Alternatively, multiple proteins having the 
specified N-terminus may be generated at this step. 
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(7) Thioester-tagged target protein and target protein 
having a specified N-termini are fused via intermediated 
protein ligation (IPL) (see Figure 2B). In a preferred 
mbodiment, the N-terminus is cysteine. Alternatively, a 
ngle protein containing both a C-terminal thioester and a 
specified N-terminus, such as a cysteine, may undergo 
intramolecular ligation to yield a cyclic product and/or 
intermolecular ligation to yield polymerized proteins. 

The methodology described by the instant invention 
significantly expands the utility of current IPL methods to 
enable the labeling of extensive portions of a protein for NMR 
analysis and the isolation of a greater variety of cytotoxic 
proteins. In addition, this advance opens the possibility of 
labeling the central portion of a protein by ligating three or 
more fragments. 

The use of an intein or inteins with N-terminal and C- 
terminal cleavage activity provides the potential to create a 
defined N-terminus, such as a cysteine, and a C-terminal 
thioester on a single protein. The intramolecular ligation of 
the resulting protein generates a circular protein, whereas the 
intermolecular ligation of several of these proteins generates 
a protein polymer. 

Cleavage at the N- and/or the C-terminus of an intein 
can be brought about by introducing changes to the intein 
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and/or its extein sequences. A,so, naturally ooouring interns 
may display these properties and require no manipulation. 
Cleavage at the N- and/or C-.erminus of an in.ein can occur 
uncontrollably or induced using nucleophilc compounds, such 
as thiol reagents, temperature. pH. salt, chaotropic agents, or 
any combination of the aforementioned conditions and/or 
reagents. 

The Examples presented below are only intended as 
specific preferred embodiments of the present invention and 
are not intended to limit the scope of the invention except as 
provided in the claims herein. The present invent.cn 
encompasses modifications and variations of the methods 
tau gh. herein which would be obvious to one o, ordinary sK.II 
in the art. 

The references cited above and below are herein 
incorporated by reference. 

EXAMPLE I 

Creation of the Mth RIR1 synthetic gene 

The gene encoding the Mth RIR1 intein along with 5 
native N- and C-extein residues (Smith et al. supra (1997)) 
was constructed using 10 oligonucleotides (New England 
Biolabs, Beverly, MA) comprising both strands of the gene, as 

follows: 
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1 ) 5'-TCGAGGCAACCAACCCCTGCGTATCCGGTGACACCATTGT 
AATGACTAGTGGCGGTCCGCGCACTGTGGCTGAACTGGAG 
GGCAAACCGTTCACCGCAC-3' (SEQ ID NO:1) 

5 2) 5'-CCGGTTGGCTGCTCGCCACAGTTGTGTACAATGAAGCCAT 

TAGCAGTGAATGCGCTAGCACCGTAAACAGTAGCGTCATA 
AACATCCTGGCGG-3' (SEQ ID NO:2) 

3 ) 5'-pTGATTCGCGGCTCTGGCTACCCATGCCCCTCAGGTTTCTT 
1 0 CCGCACCTGTGAACGTGACGTATATGATCTGCGTACACGT 

GAGGGTCATTGCTTACGTTT-3' (SRQ ID NO:3) 

4 ) 5'-pGACCCATGATCACCGTGTTCTGGTGATGGATGGTGGCCTG 
GAATGGCGTGCCGCGGGTGAACTGGAACGCGGCGACCGCC 

15 TGGTGATGGATGATGCAGCT-3' (SEQ ID NO:4) 

5 ) 5'-pGGCGAGTTTCCGGCACTGGCAACCTTCCGTGGCCTGCGTG 
GCGCTGGCCGCCAGGATGTTTATGACGCTACTGTTTACGG 
TGCTAGC-3' (SEQ ID NO:5) 



20 



6 ) 5'-pGCATTCACTGCTAATGGCTTCATTGTACACAACTGTGGCG 
AGCAGCCAA-3' (SEQ ID NO:6) 



7 ) 5'-pCCAGCGCCACGCAGGCCACGGAAGGTTGCCAGTGCCGGAA 
2 5 ACTCGCCAGCTGCATCATCCATCACCAGGCGGTCGCCGCG 

TTCCAGTTCACCCGCGGCAC-3' (SEQ ID NO:7) 

8 ) 5'-pGCCATTCCAGGCCACCATCCATCACCAGAACACGGTGATC 
ATGGGTCAAACGTAAGCAATGACCCTCACGTGTACGCAGA 

30 TCATATACGT-3' (SEQ ID NO:8) 

9 ) 5'-pCACGTTCACAGGTGCGGAAGAAACCTGAGGGGCATGGGTA 
GCCAGAGCCGCGAATCAGTGCGGTGAACGGTTTGCCCTCC 
AGTTCAGCCACAGTGCG-3' (SEQ ID NO:9) 
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t 0) 5'-pCGGACCGCCACTAGTCATTACAATGGTGTCACCGGATACG 
CAGGGGTTGGTTGCC-3' (SEQ ID NO:10) 

To ensure maximal £ «/f expression, the coding region 
of the synthetic Mth RIR1 intein incorporates 61 silent base 
mutations in 49 of the 134 codons (see Figure 4) in the 
wildtype Mth RIR1 intein gene (GenBank AE000845). The 
oligonucleotides were annealed by mixing at equimolar rat.os 
(400 nM) in a ligation buffer (50 mM Tris-HCI, pH 7.5 
containing 10 mM MgCI 2 , 10 mM dithiothreitol, 1 mM ATP, and 
25 M BSA) followed by heating to 95°C. After cooling to room 
temperature, the annealed and ligated oligonucleotides were 
inserted into the XM and Agel sites o. pMYB5 (NEB), replacing 
the See VMA intein and creating the plasmid pMRBSP. 

Engineering the ««. RIR1 intein «or N- and C-.ermina. 

* cleavage 

The unique Xnol and Spel sites flanking the N-terminal 
splice junction and the unique BsrG\ and Agei sites flankmg 
,he C-terminal splice iunction allowed substitution of ammo 
acid residues by linker replacement. The proline residue, Pre 
1 preceding the intein in pMRBSP was substituted w.th 
alanine or glycine to yield pMRB8A and pMRBBGI. respectively. 
Substitution of Pro-1-Cysi with Gly-Ser or Gly-Ala yielded 
PMRB9GS and pMRB9GA, respectively. Replacing Asn^ w,th 
Ala in PMRB8G1 resulted in pMRBIOG. The following linkers 
wore used for substitution of the native amino acds at the 
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splice junctions (each linker was formed by annealing two 
synthetic oligonucleotides as described above): 

iA ,. , c-.TCGAGGCAACCAACGCATGCGTATCCGGT 
hnker: IJcaCCATTGTAATGA-S' (SEQ ID NO:11) 

* 5'-CTAGTCATTACAATGGTGTCACCGGATAC 
and GC^TGCGTTGGTTGCC-3' (SEQ ID NO:12) 

P-1C linker- 5'-TCGAGGGCTGCGTATCCGGTGACACCATT 
10 P G ,mker " GTAATGA-3 (SEQ ID NO:13)' 

anH 5'-CTAGTCATTACAATGGTGTCACCGGATAC 
GCAGCCC-3' (SEQ ID NO:14) 

(SEQ ID NO:15) 
j 5'-CTAGTCATTACAATGGTGTCACCGGATAC 

20 and |gatccgttggttgcctcgatgccc-3' 

(SEQIDNO:16) 



p-ir/riA linker- 5'-TCGAGGGCATCGAGGCAACCAACGGCGCC 
P 1 G/C 1 A linker. GTATCCGGTGACACCATTGTAATGA -3 

(SEQ ID NO: 17) 

5'-CTAGTCATTACAATGGTGTCACCGGATAC 
3nd GGCGCCGTTGGTTGCCTCGATGCCC-3- 



N"I3 4 A linker: 5' 

GA-3' 



(SEQ IDNO-.18) 

GTACACGCATGCGGCGAGCAGCCCGG 

A-3' 
(SEQ ID NO: 19) 
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and 5'-CCGGTCCCGGGCTGCTCGCCGCATGC 
GT-3' 

(SEQ ID NO:20) 

pBRL-A was constructed by substituting the Escherichia 
coli maltose binding protein (MBP) and the Bacillus circulans 
chitin binding domain (CBD) coding regions in pMRB9GA with 
the CBD and the T4 DNA ligase coding regions, respectively, 
subcloned from the pBYT4 plasmid. 

EXAMPLE II 
Generating a thioester-tagged protein: 

The pMRB10G construct from Example I contains the Mth 
RIR1 intein engineered to undergo thiol reagent induced 
cleavage at the N-terminal splice junction (Figure 1, N- 
terminal cleavage) and was used to isolate proteins with a C- 
terminal thioester as described previously for the See VMA 
and Mxe GyrA inteins (Chong et al. supra 1997); Evans et al., 
supra (1998)). Briefly, ER2566 cells (Evans et.al. (1998)) 
containing the appropriate plasmid were grown at 37°C in LB 
broth containing 100 jig/mL ampicillin to an OD600 of 0.5-0.6 
followed by induction with IPTG (0.5 mM). Induction was 
either overnight at 15°C or for 3 hours at 30°C. 

The cells were pelleted by centrifugation at 3,000xg for 
30 minutes followed by resuspension in buffer A (20 mM Tris- 
HCI, pH 7.5 containing 500 mM NaCI). The cell contents were 
released by sonication. Cell debris was removed by 
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centrifugation at 23 ( 000xg for 30 minutes and the supernatant 
was applied to a column packed with chitin resin (10 mL bed 
volume) equilibrated in buffer A. Unbound protein was washed 
from the column with 10 column volumes of buffer A. 

Thiol reagent-induced cleavage was initiated by rapidly 
equilibrating the chitin resin in buffer B (20 mM Tris-HCI. P H 
8 containing 500 mM NaCl and 100 mM 2-mercaptoethane- 
sulfonic acid (MESNA)). The cleavage reaction, which 
simultaneously generates a C-terminal thioester on the target 
protein, proceeded overnight at 4°C after which the protein 
was eluted from the column. The use of the pMRB10G 
construct resulted in the isolation of MBP with a C-terminal 
thioester (Figure2A). 



Isolating proteins with an N-terminal cysteine 

The pBRL-A construct from Example I contains an Mth 
RIR1 intein engineered to undergo controllable cleavage at its 
C-terminus, and was used to purify proteins with an N- 
terminal cysteine (Figure 1, C-terminal cleavage). The 
expression and purification protocol was performed as 
described in Example II, except with buffer A replaced by 
buffer C (20 mM Tris-HCI, P H 8.5 containing 500 mM NaCl) and 
buffer B replaced by buffer D (20 mM Tris-HCI, pH 7.0 
containing 500 mM NaCl). Also, following equilibration of the 
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column in buffer D the cleavage reaction proceeded overnight 
at room temperature. 

The expression of plasmid pBRL-A resulted in the 
purification of 4-6 mg/L cell culture of T4 DNA ligase 
possessing an N-terminal cysteine (Figure 2A). Protein 
concentrations were determined using the Bio-Rad protein 
assay (Bio-Rad Laboratories, Inc., Hercules, CA). 

EXAMPLE III 

Protein-protein ligation using Intein-mediated 

Protein Ligation 

Intein-mediated protein ligation (IPL) was used to fuse 
two proteins (Figure 2B). Freshly isolated thioester-tagged 
protein from Example II was mixed with freshly isolated 
protein containing an N-terminal cysteine residue from 
Example II, with typical starting concentrations of 1-200 |aM. 
The solution was concentrated with a Centriprep 3 or 
Centriprep 30 apparatus (Millipore Corporation, Bedford, MA) 
then with a Centricon 3 or Centricon 10 apparatus to a final 
concentration of 0.15-1.2 mM for each protein. 

Ligation reactions proceeded overnight at 4°C and were 
visualized using SDS-PAGE with 12% Tris-glycine gels (Novex 
Experimental Technology, San Diego, CA) stained with 
Coomassie Brilliant Blue. Typical ligation efficiencies ranged 
from 20-60%. 



^ DockeWo.: NEB-154 

- 20 - 



Confirmation of ligation in IPL reactions 

A Factor Xa site in MBP that exists 5 amino acids N- 
terminal from the site of fusion (Maina et el. supra (1988)) 
allowed amino acid sequencing through the ligation juncon. 
The sequence obtained was NH2-TLEGCGEQPTGXLK-COOH (SEQ 
ID NO-21) which matched the last 4 residues of MBP (TLEG) 
followed by a linker sequence (CGEQPTG (SEQ ID NO:22» and 
the start of T4 DNA ligase (ILK). During amino acid 
sequencing, the cycle expected .0 yield an isoleucine did no, 
have a strong enough signal to assign it to a specific res.due, 
so it was represented as an X. The cysteine was identrfied as 
the acrylamide alkylation product. 

The Factor Xa proteolysis was performed on 2 mg of 
ligation reaction involving MBP and T4 DNA ligase. This 
reaction mixture was bound to 3 mL of amylose resin (New 
England Biolabs, Inc., Beverly, MA) equilibrated in buffer A 
(see Example II). Unreacted T4 DNA ligase was rinsed from 
, he column with 10 column volumes of buffer A. Unliga.ed 
MBP and the MBP-T4 DNA ligase fusion protein were eluted 
from the amylose resin using buffer E (20 mM Tris-HCI, pH 7.5 
containing 500 mM NaCI and 10 mM maltose). Overmgh. 
incubation of the eluted protein with a 200:1 pro,ein:bov,ne 
Factor Xa (NEB) ratio (w/w) a. 4°C resulted in the proteolys.s 
of the fusion protein and regeneration of a band on SDS-PAGE 
geis that ran a, a molecular weigh, similar to T4 DNA ligase. 
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N-terminal amino acid sequencing of the proteolyzed fusion 
protein was performed on a Precise 494 protein sequencer (PE 
Applied Biosystems, Foster City, CA). 

Temperature sensitivity of the Mth RIR1 intein 

The cleavage and/or splicing activity of the Mth RIR1 
intein was more proficient when protein synthesis was 
induced at 15°C than when the induction temperature was 
raised to 37°C (Figure 3). The effect temperature has on the 
Mth RIR1 represents a way to control the activity of this 
intein for use in controlled splicing or cleavage reactions. 
Replacement of Pro-i with a G.y and Cys^ with a Ser resulted 
in a double mutant, the pMRB9GS construct, which showed only 
/„ vivo C-terminal cleavage activity when protein synthes.s 
was induced at 15°C but not at 370 C . Another double mutant, 
the P MRB9GA construct, displayed slow cleavage, even at 
1 5 o C , which allowed the accumulation of substantial amounts 
of the precursor protein and showed potential for use as a C- 
terminal cleavage construct for protein purification. 



